智能论文笔记

Exploring evolution-based & -free protein language models as protein function predictors

Mingyang Hu , Fajie Yuan , Kevin K. Yang , Fusong Ju , Jin Su , Hui Wang , Fei Yang , Qiuyang Ding

分类：人工智能

2022-06-14

大规模蛋白质语言模型（PLM）在蛋白质预测任务中的性能提高，范围从3D结构预测到各种功能预测。特别是，Alphafold（一种开创性的AI系统）可能会重塑结构生物学。但是，尚未探索超出结构预测的AlphaFold，Evoformer的PLM模块的效用。在本文中，我们研究了三个流行PLM的表示能力：ESM-1B（单序），MSA转换器（多个序列比对）和Evoformer（结构），并特别关注Evoformer。具体而言，我们旨在回答以下关键问题：（i）作为Alphafold的一部分，Evoformer是否会产生可预测蛋白质功能的表示形式？（ii）如果是的，可以替换ESM-1B和MSA转换器？（iii）这些PLM多少依赖于进化相关的蛋白质数据？在这方面，他们彼此补充吗？我们通过实证研究以及新的见解和结论来比较这些模型。最后，我们发布代码和数据集以获得可重复性。

translated by 谷歌翻译

Towards Knowledge-Intensive Text-to-SQL Semantic Parsing with Formulaic Knowledge

Longxu Dou , Yan Gao , Xuqi Liu , Mingyang Pan , Dingzirui Wang , Wanxiang Che , Dechen Zhan , Min-Yen Kan , Jian-Guang Lou

分类：自然语言处理

2023-01-03

In this paper, we study the problem of knowledge-intensive text-to-SQL, in which domain knowledge is necessary to parse expert questions into SQL queries over domain-specific tables. We formalize this scenario by building a new Chinese benchmark KnowSQL consisting of domain-specific questions covering various domains. We then address this problem by presenting formulaic knowledge, rather than by annotating additional data examples. More concretely, we construct a formulaic knowledge bank as a domain knowledge base and propose a framework (ReGrouP) to leverage this formulaic knowledge during parsing. Experiments using ReGrouP demonstrate a significant 28.2% improvement overall on KnowSQL.

translated by 谷歌翻译

Analogical Inference Enhanced Knowledge Graph Embedding

Yao Zhen , Zhang Wen , Chen Mingyang , Huang Yufeng , Yang Yi , Chen Huajun

分类：人工智能 | 自然语言处理

2023-01-03

Knowledge graph embedding (KGE), which maps entities and relations in a knowledge graph into continuous vector spaces, has achieved great success in predicting missing links in knowledge graphs. However, knowledge graphs often contain incomplete triples that are difficult to inductively infer by KGEs. To address this challenge, we resort to analogical inference and propose a novel and general self-supervised framework AnKGE to enhance KGE models with analogical inference capability. We propose an analogical object retriever that retrieves appropriate analogical objects from entity-level, relation-level, and triple-level. And in AnKGE, we train an analogy function for each level of analogical inference with the original element embedding from a well-trained KGE model as input, which outputs the analogical object embedding. In order to combine inductive inference capability from the original KGE model and analogical inference capability enhanced by AnKGE, we interpolate the analogy score with the base model score and introduce the adaptive weights in the score function for prediction. Through extensive experiments on FB15k-237 and WN18RR datasets, we show that AnKGE achieves competitive results on link prediction task and well performs analogical inference.

translated by 谷歌翻译

MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing

Longxu Dou , Yan Gao , Mingyang Pan , Dingzirui Wang , Wanxiang Che , Dechen Zhan , Jian-Guang Lou

分类：自然语言处理

2022-12-27

Text-to-SQL semantic parsing is an important NLP task, which greatly facilitates the interaction between users and the database and becomes the key component in many human-computer interaction systems. Much recent progress in text-to-SQL has been driven by large-scale datasets, but most of them are centered on English. In this work, we present MultiSpider, the largest multilingual text-to-SQL dataset which covers seven languages (English, German, French, Spanish, Japanese, Chinese, and Vietnamese). Upon MultiSpider, we further identify the lexical and structural challenges of text-to-SQL (caused by specific language properties and dialect sayings) and their intensity across different languages. Experimental results under three typical settings (zero-shot, monolingual and multilingual) reveal a 6.1% absolute drop in accuracy in non-English languages. Qualitative and quantitative analyses are conducted to understand the reason for the performance drop of each language. Besides the dataset, we also propose a simple schema augmentation framework SAVe (Schema-Augmentation-with-Verification), which significantly boosts the overall performance by about 1.8% and closes the 29.5% performance gap across languages.

translated by 谷歌翻译

PyPop7: A Pure-Python Library for Population-Based Black-Box Optimization

Qiqi Duan , Guochen Zhou , Chang Shao , Zhuowei Wang , Mingyang Feng , Yijun Yang , Qi Zhao , Yuhui Shi

分类：神经与进化计算

2022-12-12

In this paper, we present a pure-Python open-source library, called PyPop7, for black-box optimization (BBO). It provides a unified and modular interface for more than 60 versions and variants of different black-box optimization algorithms, particularly population-based optimizers, which can be classified into 12 popular families: Evolution Strategies (ES), Natural Evolution Strategies (NES), Estimation of Distribution Algorithms (EDA), Cross-Entropy Method (CEM), Differential Evolution (DE), Particle Swarm Optimizer (PSO), Cooperative Coevolution (CC), Simulated Annealing (SA), Genetic Algorithms (GA), Evolutionary Programming (EP), Pattern Search (PS), and Random Search (RS). It also provides many examples, interesting tutorials, and full-fledged API documentations. Through this new library, we expect to provide a well-designed platform for benchmarking of optimizers and promote their real-world applications, especially for large-scale BBO. Its source code and documentations are available at https://github.com/Evolutionary-Intelligence/pypop and https://pypop.readthedocs.io/en/latest, respectively.

translated by 谷歌翻译

Magic: Multi Art Genre Intelligent Choreography Dataset and Network for 3D Dance Generation

Ronghui Li , Junfan Zhao , Yachao Zhang , Mingyang Su , Zeping Ren , Han Zhang , Xiu Li

分类：计算机视觉

2022-12-07

Achieving multiple genres and long-term choreography sequences from given music is a challenging task, due to the lack of a multi-genre dataset. To tackle this problem,we propose a Multi Art Genre Intelligent Choreography Dataset (MagicDance). The data of MagicDance is captured from professional dancers assisted by motion capture technicians. It has a total of 8 hours 3D motioncapture human dances with paired music, and 16 different dance genres. To the best of our knowledge, MagicDance is the 3D dance dataset with the most genres. In addition, we find that the existing two types of methods (generation-based method and synthesis-based method) can only satisfy one of the diversity and duration, but they can complement to some extent. Based on this observation, we also propose a generation-synthesis choreography network (MagicNet), which cascades a Diffusion-based 3D Diverse Dance fragments Generation Network (3DGNet) and a Genre&Coherent aware Retrieval Module (GCRM). The former can generate various dance fragments from only one music clip. The latter is utilized to select the best dance fragment generated by 3DGNet and switch them into a complete dance according to the genre and coherent matching score. Quantitative and qualitative experiments demonstrate the quality of MagicDance, and the state-of-the-art performance of MagicNet.

translated by 谷歌翻译

Focus! Relevant and Sufficient Context Selection for News Image Captioning

Mingyang Zhou , Grace Luo , Anna Rohrbach , Zhou Yu

分类：计算机视觉 | 自然语言处理

2022-12-01

News Image Captioning requires describing an image by leveraging additional context from a news article. Previous works only coarsely leverage the article to extract the necessary context, which makes it challenging for models to identify relevant events and named entities. In our paper, we first demonstrate that by combining more fine-grained context that captures the key named entities (obtained via an oracle) and the global context that summarizes the news, we can dramatically improve the model's ability to generate accurate news captions. This begs the question, how to automatically extract such key entities from an image? We propose to use the pre-trained vision and language retrieval model CLIP to localize the visually grounded entities in the news article and then capture the non-visual entities via an open relation extraction model. Our experiments demonstrate that by simply selecting a better context from the article, we can significantly improve the performance of existing models and achieve new state-of-the-art performance on multiple benchmarks.

translated by 谷歌翻译

Relational Message Passing for Fully Inductive Knowledge Graph Completion

Yuxia Geng , Jiaoyan Chen , Jeff Z. Pan , Mingyang Chen , Song Jiang , Wen Zhang , Huajun Chen

分类：人工智能

2022-10-08

In knowledge graph completion (KGC), predicting triples involving emerging entities and/or relations, which are unseen when the KG embeddings are learned, has become a critical challenge. Subgraph reasoning with message passing is a promising and popular solution. Some recent methods have achieved good performance, but they (i) usually can only predict triples involving unseen entities alone, failing to address more realistic fully inductive situations with both unseen entities and unseen relations, and (ii) often conduct message passing over the entities with the relation patterns not fully utilized. In this study, we propose a new method named RMPI which uses a novel Relational Message Passing network for fully Inductive KGC. It passes messages directly between relations to make full use of the relation patterns for subgraph reasoning with new techniques on graph transformation, graph pruning, relation-aware neighborhood attention, addressing empty subgraphs, etc., and can utilize the relation semantics defined in the ontological schema of KG. Extensive evaluation on multiple benchmarks has shown the effectiveness of techniques involved in RMPI and its better performance compared with the existing methods that support fully inductive KGC. RMPI is also comparable to the state-of-the-art partially inductive KGC methods with very promising results achieved. Our codes and data are available at https://github.com/zjukg/RMPI.

translated by 谷歌翻译

MetaDIP: Accelerating Deep Image Prior with Meta Learning

Kevin Zhang , Mingyang Xie , Maharshi Gor , Yi-Ting Chen , Yvonne Zhou , Christopher A. Metzler

分类：计算机视觉

2022-09-18

深图像先验（DIP）是一种最近提出的技术，用于通过将重建图像拟合到未经训练的卷积神经网络的输出中来解决成像反问题。与预处理的前馈神经网络不同，相同的倾角可以概括为任意逆问题，从降级到阶段检索，同时在每个任务下提供竞争性能。DIP的主要缺点是，虽然前馈神经网络可以在单个通行证中重建图像，但DIP必须以大量的计算成本逐渐更新数百到数千个迭代的权重。在这项工作中，我们使用元学习来大规模加速基于倾斜的重建。通过学习浸入权重的适当初始化，我们证明了在一系列逆成像任务中的运行时间有10倍的改善。此外，我们证明了一个经过训练以快速重建面孔的网络也将其推广以重建自然图像贴片。

translated by 谷歌翻译

SuperLine3D: Self-supervised Line Segmentation and Description for LiDAR Point Cloud

Xiangrui Zhao , Sheng Yang , Tianxin Huang , Jun Chen , Teng Ma , Mingyang Li , Yong Liu

分类：计算机视觉

2022-08-03

电线杆和建筑物边缘经常是城市道路上可观察到的对象，为各种计算机视觉任务提供了可靠的提示。为了重复提取它们作为特征并在离散激光镜头框架之间进行注册，我们提出了第一个基于学习的功能分割和LIDAR点云中3D线的描述模型。为了训练我们的模型，而无需耗时和乏味的数据标记过程，我们首先生成了目标线基本外观的合成原始图，并构建一个迭代线自动标记的过程，以逐步完善真实激光扫描的线路标签。我们的分割模型可以在任意规模的扰动下提取线，我们使用共享的EDGECONV编码层共同训练两个分割和描述符头。基于模型，我们可以在没有初始转换提示的情况下构建一个高度可用的全局注册模块，用于点云注册。实验表明，我们基于线的注册方法对基于最先进的方法的方法具有很高的竞争力。我们的代码可在https://github.com/zxrzju/superline3d.git上找到。

translated by 谷歌翻译